Quick Guide to Gawk Scripts
===========================

(June 2011)

atextractandtime.awk			Extracts @replies with additional time information in various formats
atextractandtype.awk			Extracts @replies and adds true/false columns indicating whether they are plain @replies or RTs
atextractfromtoonly.awk		Extracts @replies edges (from > to) only, for visualisation in Gephi
atreplycount.awk			Finds @replies from/to given users, and calculates occurrences per time period or total stats
c2t.awk				Converts CSV to TSV
explodetime.awk			Expands tweet timestamps into time information at various levels of resolution
extractimages.awk			Extracts tweets containing URLs from known image hosting services
filter.awk				Filters data, keeping only lines that include given search terms
filterinverse.awk			Filters data, removing all lines that include given search terms
filtertweets.awk			Filters data, keeping only tweets that include given search terms
filtertweetsinverse.awk		Filters data, removing all tweets that include given search terms
filterusers.awk			Filters data, keeping only tweets from given users
filterusersinverse.awk		Filters data, removing all tweets from given users
gexfattimeintervals.awk		Processes preparegexfattimeintervals.awk output into GEXF file for dynamic visualisation of @replies
hashextract.awk			Extracts all hashtags from tweets
multicount.awk			Finds tweets containing given search terms, and calculates their occurrences per time period or total stats
multifilter.awk			Finds tweets containing given search terms, and marks the occurrence of those terms in added true/false columns
nthtweet.awk				Retains only every nth tweet in the source file
preparegexfattimeintervals.awk	Prepares @reply data for dynamic visualisation in Gephi - preprocessor for gexfattimeintervals.awk
t2c.awk				Converts TSV to CSV
timeframe.awk				Retains only tweets made during a given timeframe
tweetsonly.awk			Retains only the tweet content
urlextract.awk			Extracts URLs from tweets
urlresolve.awk			Resolves extracted URLs to their final destinations - builds on urlextract.awk
urltruncate.awk			Truncates extracted URLs to their domains - builds on urlresolve.awk
wordfreq.awk				Simple count of keywords occurring in tweets
youtubeextract.awk			Extracts and standardises YouTube URLs occurring in tweets - builds on urlresolve.awk
